De novo assembly of the common marmoset transcriptome from NextGen mRNA sequences

نویسندگان

  • Mnirnal D Maudhoo
  • Dongren Ren
  • Julien S Gradnigo
  • Robert M Gibbs
  • Austin C Lubker
  • Etsuko N Moriyama
  • Jeffrey A French
  • Robert B Norgren
چکیده

BACKGROUND Nonhuman primates are important for both biomedical studies and understanding human evolution. Although research in these areas has mostly focused on Old World primates, such as the rhesus macaque, the common marmoset (Callithrix jacchus), a New World primate, offers important advantages in comparison to other primates, such as an accelerated lifespan. To conduct Next Generation expression studies or to study primate evolution, a high quality annotation of the marmoset genome is required. The availability of marmoset transcriptome data from five tissues, including both raw sequences and assembled transcripts, will aid in the annotation of the newly released marmoset assembly. FINDINGS RNA WAS EXTRACTED FROM FIVE TISSUES: skeletal muscle, bladder and hippocampus from a male common marmoset, and cerebral cortex and cerebellum from a female common marmoset. All five RNA samples were sequenced on the Illumina HiSeq 2000 platform. Sequences were deposited in the NCBI Sequence Read Archive. Transcripts were assembled, annotated and deposited in the NCBI Transcriptome Shotgun Assembly database. CONCLUSIONS We have provided a high quality annotation of 51,163 transcripts with full-length coding sequence. This set represented a total of 10,833 unique genes. In addition to providing empirical support for the existence of these 10,833 genes, we also provide sequence information for 2,422 genes that were not previously identified in the Ensembl annotation of the marmoset genome.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering of Short Read Sequences for de novo Transcriptome Assembly

Given the importance of transcriptome analysis in various biological studies and considering thevast amount of whole transcriptome sequencing data, it seems necessary to develop analgorithm to assemble transcriptome data. In this study we propose an algorithm fortranscriptome assembly in the absence of a reference genome. First, the contiguous sequencesare generated using de Bruijn graph with d...

متن کامل

De novo assembly of the chimpanzee transcriptome from NextGen mRNA sequences

BACKGROUND Common chimpanzees (Pan troglodytes) and bonobos (Pan paniscus) are the species most closely related to humans. For this reason, it is especially important to have complete and accurate chimpanzee nucleotide and protein sequences to understand how humans evolved their unique capabilities. We provide transcriptome data from four untransformed cell types derived from the reference Pan ...

متن کامل

Qualitative De Novo Analysis of Full Length cDNA and Quantitative Analysis of Gene Expression for Common Marmoset (Callithrix jacchus) Transcriptomes Using Parallel Long-Read Technology and Short-Read Sequencing

The common marmoset (Callithrix jacchus) is a non-human primate that could prove useful as human pharmacokinetic and biomedical research models. The cytochromes P450 (P450s) are a superfamily of enzymes that have critical roles in drug metabolism and disposition via monooxygenation of a broad range of xenobiotics; however, information on some marmoset P450s is currently limited. Therefore, iden...

متن کامل

TransRate: reference-free quality assessment of de novo transcriptome assemblies.

TransRate is a tool for reference-free quality assessment of de novo transcriptome assemblies. Using only the sequenced reads and the assembly as input, we show that multiple common artifacts of de novo transcriptome assembly can be readily detected. These include chimeras, structural errors, incomplete assembly, and base errors. TransRate evaluates these errors to produce a diagnostic quality ...

متن کامل

De novo transcriptome assembly of heavy metal tolerant Silene dioica

Silene dioica is a dioecious plant of the family Caryophyllaceae. In the present study, we used Illumina sequencing technology (MiSeq) to sequence, de novo assembly and annotate the transcriptomes of male and female copper tolerant S. dioica individuals. We sequenced the normalized mRNA of roots, shoots, flower buds and flowers for each sex. Raw reads of the transcriptome assembly project for S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2014